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In this paper we calculate upper bounds on fault tolerance, without restrictions on the overhead 
involved. Optimally adaptive recovery operators are used, and the Shannon entropy is used to 
estimate the thresholds. By allowing for unrealistically high levels of overhead, we find a quantum 
fault tolerant threshold of 6.88% for the depolarizing noise used by Knill which compares well 
to "above 3%" evidenced by Knill. We conjecture that the optimal threshold is 6.90%, based upon 
the hashing rate. We also perform threshold calculations for types of noise other than that discussed 
by Knill. 



I. INTRODUCTION 

In this paper, we calculate the quantum fault toler- 
ant thresholds for the [[7, 1, 3]] and [[23, 1, 7]] CSS codes 
under massive amounts of post-selection. Post-selection 
is the process of performing error detection, and then 
rejecting the result if an error is detected. We optimize 
the recovery operators under concatenation Q to find the 
optimal thresholds. Here there are no restrictions on the 
quantity of overhead involved. Optimizations of the error 
threshold with respect to the overhead will be considered 
in future work. 

Suppose that, given a logical data qubit \tpD) encoded 
into some physical data qubits and a logical ancilla qubit 
\iPa) encoded into some physical ancilla qubits, we wish 
to detect bit flip errors in the data qubits. We apply CNOT 
gates transversally from the data qubits to the ancilla 
qubits, and then measure the ancilla qubits, as shown in 
Fig. [TJ The fact that the syndrome measurement may 
be faulty is included in our analysis. For post-selection, 
we start all over if an error is detected. 

This process works well for CSS codes and bit flip 
qubits. CNOT is encoded as itself acting transversally on 
each pair of qubits. In terms of the logical qubits, the 
result will resemble Fig. O For an arbitrary logical data 



FIG. 1: General bit flip error detection process. If we are 
performing post-selection, we start all over if an error is de- 
tected. 
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qubit IV'd), we use [i/.^) = \+), where \+) = -ij(|0)-f|l)) 
for the logical ancilla qubit. If the logical source qubit is 
an encoded Pauli eigenstate of Z, that is |0) or |1), then 
it is more efficient for the logical destination ancilla qubit 
to also be an encoded Pauli eigenstate of Z. 

We assume the ideal case that the noise on each qubit 
is independent, and that any bit flip error that was sent 
to or was already existing on the destination (ancilla) 
qubits can be detected. We model this by looking at 
each physical data qubit and physical ancilla qubit pair 
separately. However, it is not exactly true that all of 
these errors can be detected; if w is the lowest weight 
(that is, the number of qubits for which the Pauli matrix 
is not the identity on that qubit) of a stabilizer element 
(other than the identity) for the code, then there are 
some cases where a total of w errors in the source or 
destination qubits can result in an undetected stabilizer 
error in the destination qubits. 

We run error detection separately for bit flip errors (X 
or Y) and phase flip errors [Z or Y). One way to do this 
is to apply Hadamard gates in between the error detec- 
tion steps to change bit flip errors into phase flip errors, 
and vice versa. In this paper, we apply post-selection to 
reduce the rate of errors in some ancilla qubits. 

In Sec. ini the noise resulting from a CNOT gate followed 
by measurement of the destination qubit is found. The 
same results are derived using the channel maps 0, [l] of 
quantum codes in Appendix [XI Both of these methods 
assume the same noise on each qubit. 

In Sec. mil we apply these results to a code encoded 
in some specific ancilla qubits that allow for an encoded 
CNOT with error correction using just one CNOT per logical 
data qubit. We again assume the same noise on each 
qubit. 

In Sec. IIVI we consider realistic types of noise, and 
calculate the relevant quantum fault tolerant thresholds. 



FIG. 2: Logical qubits under bit flip error detection 
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In Sec. |Vl we examine how the fact that the noise is not 
the same on each qubit affects the threshold. We use the 
Shannon information entropy [2l| as in 0] to calculate 
the actual thresholds. 

In Sec. IVIl we look at how well the noise is corrected 
when we are below the fault tolerant threshold. 



II. BASIC CALCULATIONS 



Channel Notation 



A qubit has two orthogonal states, |0) and If there 
is a set of states \tpi) with corresponding probabilities of 
occurring pi, the resulting mixed state is represented by 
the density matrix 



We can write this 2 by 2 matrix in terms of the 4 one 
qubit Pauli matrices /, X, Y and Z as 

P^^{I + cxX + cyY + czZ). 

Alternately, one can write this density matrix in vector 
form as 



v{p) 



■ 1 ■ 

cx 

CY 

cz. 



(1) 



which we shall call a density vector. Now we consider the 
map from density matrices to density matrices. These 
maps are of the form 



P- 



They can be written as the channel superoperator matrix 
that acts on the density vectors as 



■ 1 ■ 

Nxi Nxx NxY Nxz 

Nyi Nyx Nyy Nyz 

Nzi Nzx NzY Nzz. 



where the Nca' are all real. 

A unitary matrix U acts upon a density state as p — > 
UpW . We represent it in the channel superoperator form 
as a superoperator 0{U). We represent these in the Pauli 
basis. A Pauli matrix a has the diagonal superoperator 
0{a), where each of the coefficients 0{cr)cr'a' is 1 if u 
and a' commute, and —1 if they anti-commute. For the 
1 qubit Pauli matrices, the superoperators are diagonal 
and are written as 



[1,1, 1,1] 

o(y) = [1,-1,1,-1] 



0(X) = [1, 1,-1,-1] 
= [1,-1,-1,1]' 



(2) 



where we record the diagonal elements in vector form. 
We can write a diagonal channel in terms of the proba- 
bilities Pa of having the Pauli error a as 



[1, Nxx, Nyy, N, 



zz\ 



E 



PaO{o) = [l,a;,y,2;], 



(3) 



which, using Eq. gives 

x = \~ 2{pY +pz) 
y = 1 - 2{px +pz) 
z = 1 - 2{px +Py)- 

B. Measurement channels 



Proposition 1 // we have a pure state v = \'ip) and 
a state p, and we measure the state p in an orthonormal 
basis which includes \^p) , then the probability of measur- 
ing lip) is the fidelity, which is half of the inner product 
of the vectorized density matrices (Eq. Qp; 

Fip,iy)^^{v{p)\v{iy)). 

Proof: p can be written as a linear combination of pure 
states as p — (7«l, ^^"^ ^^'^ probability of 

measuring 1-0) is 

i i 

Now, tr pv = J2i {o-il Wi) for ^-ny basis joi), so 
tr pu^iv p IV') (V-l = p IV') (V-IV-) = (V'l P 1^) = F. 

If 



v{p) 



- 1 ■ 




■ 1 - 


PX 


,v{iy) = 




PY 


VY 


.PZ. 




yz. 



then 



P^ = \ ^P'y^'y = \ {'"iP)H^))- 



From this it follows that: 

Corollary 2 If a noise J\f is applied to the state p, the 
probability of measuring the pure state v is 



^v{v)\MHp)). 



For example, if they are both the pure state |0), p = v 
i(/ + Z), and so 



i [1 1] 



Nil + Niz + Nzi + Nzz 
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TABLE L Measurement probabilities 



Measured state 


Initial state Ai = 1 


Initial state Ai = — 1 


= 1 

Xm = ~1 




Nii-Ni^+N„j-N^„ 


2 


2 

Nij-Nj„-N„i + N„^ 


2 


2 



TABLE II: The noise 7^ = ©(CNOT) o (S ® D) o C'(CNOT) re- 
sulting from S (Si'D conjugated with CNOT. 



Theorem 3 If a noise Af is applied to a quhit, and then 
the qubit is measured in the Pauli operator a basis, which 
yields one oj the Pauli eigenstates \(J+) or\a'^), the cor- 
rect result is obtained by the measurement with probability 



Proof: We have initial state pi and measured state p™, 
which are eigenstates of a, with associated eigenvalues 
\i and Am. Then, from Cor. [2l after a noise M occurs, 
the probability of measuring the initial state pi as the 
measured state Pm is 



Nil + XiNi„ + \2N„i + \i\2N„ 



Tab. [J gives the probabilities for the 4 cases in which the 
initial eigenvalue is either —1 or +1 and the measured 
eigenvalue A™ is either —1 or +1. 

The total probability of a wrong measurement averages 
and the total probability of a correct measure- 



'Nil Ni,' 




'l' 


N„i N„a_ 







to 



1-N„ 



ment averages to 



C. Noise from applying CNOT 

As shown in Fig. [31 we have a source qubit \ip)s with 
pre-existing local noise 5, and a destination qubit ^ 
with pre-existing local noise D. When we apply CNOT, 
the two qubit noise Q occurs on these qubits. The total 
superoperator for this process is found by multiplying the 
initial noise S^V first by the superoperator for CNOT and 
then by the new noise Q from CNOT. This results in 

r ^ Q o oicmi) o [s ® D), 

where o represents the matrix multiplication of superop- 
erators. Since we wanted only CNOT to be applied, the 
total noise can be represented in terms of a noise super- 
operator N as 

T = 7VoC'(CN0T) = QoO(CN0T)o(5®i:')oC'(CN0T). (4) 



cr TZrrI 


TiaX 


TZaY 


TlaZ 


I SiDl 


SiDx 


SzDy 


SzDz 


X SxDx 


SxDi 


SyDz 


SyDy 


Y SyDx 


SyDi 


SxDz 


^X^Y 


Z SzDi 


SzDx 


SiDy 




TABLE III: The total noise M resulting from the 


error detec- 


tion process. 








cr Mai 


Max 


MaY 


MaZ 


I SiDiQii 


SiDxQix 


SzDyQiy 


SzDzQiz 


X SxDxQxi 


SxDiQxx 


SyDzQxy 


SyDyQxz 


Y SyDxQyi 


SyDiQyx 


SxDzQyy 


SxDyQyz 


Z SzDiQzi 


Sz DxQzx 


SiDyQzy 


SiDzQzz 



maps a density matrix I + a ^ I + N„a. The noise on 
the source qubits has diagonal noise S" = [1, Sx, Sy, Sz], 
and the noise on the destination qubits has diagonal noise 
D=[1,Dx,Dy,Dz]. 

If the off-diagonal terms are small, they can be ne- 
glected anyway. If they are o(e), the threshold is only 
affected by o(e'"), where m is the minimum weight of 
the non- identity elements that stabilize the code For 
the [[7,1,3]] CSS code, to = 4. For the [[23,1,7]] CSS 
code, TO = 8. 

To determine how the CNOT gate effects the errors, we 
look at the map of the Pauli matrices under conjugation 
by CNOT. This gives the map 

(//), {IX), lY ^ ZY, IZ ^ ZZ, XI ^ XX 
XY ^ YZ, XZ ^ YY, YI ^ YX, (ZI), (ZX). 

The diagonal noise 7^ = O(CNOT) o {S ^ D) o O(CNOT) is 
given by Table[TTl From this, the resulting diagonal noise 
A/" = Q o 7^ is given in Table [ml 



1. After Z measurement on destination qubit 

Now we assume that the destination qubit is set up so 
that we can detect a Z error upon measuring it in the Z 



FIG. 3: A pair of physical qubits under error detection. Pre- 
existing noise on the source qubit is given by 5, pre-existing 
noise on the destination qubit by T>. The noise from the CNOT 
gate is Q. 



D. Diagonal noise 

In the rest of this paper, we assume that the noise is di- 
agonal. We utilize the shorthand Ma = M'aa ■ This noise 



Ms —[5 



Q 



4 



basis. From Theorem|3l we have two types of syndromes, 
the "correct" one which corresponds to no error detected, 
and the "incorrect" one which corresponds to an error 
detected. These have probabihties and ^-j^. By 
tracing out N on the destination qubit, these result in 



Then Eq. [7| becomes 



This gives 



-•IX 



-,11 



1 



(.4 + B) and g 



IX 



1 



(5) 



[A-B), (6) 



where 



A = [1, SxDxQxi. SyDxQyi, SzQzi] 
B - [SzDzQiz, SyDyQxz, SxDyQyz, DzQzz] 

are the 1st and 4th columns of Tab. IIIIl respectively. 



2. After X measurement on source qubit and Z 
measurement on destination qubit 

Like before, we trace out, this time with Tracing 
out Eq. [SI we get probabilities of Pauli errors 



Pi 




px 


1 


Py 


^ 4 


Pz. 





1111 

11-1-1 
1-11-1 
1-1-1 1 



1 

SxDxQxi 
SyDyQxz 
SzDzQiz 



(7) 



Note that only bit flip errors on the destination qubit and 
phase flip errors on the source qubit contribute to these 
errors. 



E. Measurement errors 

Suppose that we have probability Pm of an X measure- 
ment error. This is equivalent to probability px = Pm of 
X error and probability pj = 1 — pm of an / error (no 
error) right before measurement. From Eq. [2l this has 
the channel 

[1, X, y, z]^{l~ p„,)0{I) + PmO{X) = [1, 1, m, m], 
where 

m = 1 - 2pm. (8) 

Similarly, Z measurement errors produce the channel 
[^,x,y,z\ = [l,TO,m,l]. 

This changes the resulting noise after measurement 
from Eq. [6] to 

= i ( A + mS) and ^ = ^{A ^ mB) . 



PI 




"l 


1 


1 


1 " 




1 


Px 


1 


1 


1 


-1 


-1 




mSxDxQxi 


Py 


^ 4 


1 


-1 


1 


-1 




tu^SyDyQxz 


Pz. 




1 


-1 


-1 


1 




mSzDzQiz 



III. SPECIAL ANCILLA QUBITS 

Suppose that we have a high distance (minimum 
weight of an undetected error) code. We have found an- 
cilla qubit encodings that allow for a higher quantum 
fault tolerant threshold than other methods [16]. These 
ancilla qubits are logical qubits encoded in a high dis- 
tance quantum code. We then use quantum teleportation 
to bring in logical data qubits. 

Once we have sufficiently reduced the rates of errors in 
these qubits via post-selection, we use quantum telepor- 
tation [l| to replace the logical ancilla qubits with logical 
data qubits, as is also done in the work of Knill 14] and 
Reichardt [i£| . The simplest teleportation is to use a Bell 
pair (two qubit cat state) 



^(|00) + 111)) 



and apply the standard teleportation gate given in Fig. 
m If 16) represents the standard basis elements of Ji/'), 
then we start with \b) \a) \a). After CNOT is applied from 
the first to the second qubit, it becomes 16) la ©6) \a). 
Measuring the second qubit results in 16) 16) if is mea- 
sured, and 16) © 16 © 1) if 1 is measured. Applying X 
to the last qubit if 1 is measured results in 16) 16). The 
last measurement and conditional gate gives either 16) or 
the initial ]■)/') state. The three steps in Fig. 0] can be 
represented as 

16) la) la) ^16) la ©6) la) ^16) 16) ^16). 

While this both inputs and outputs l^/i), the advantage 
is that the output can be a post-selected logical qubit, 
which can be used to correct errors in the physical qubits 
that comprise an arbitrary logical data qubit. 

Now, if we have the 3 qubit cat state 



1 



V2 



(1000) -^1111)) 



-L5:iaaa) 

^ a— 



FIG. 4: Teleportation circuit 

1^) (x) 



35(|00> + 111» 



V2 



X 
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and apply a similar process shown in Fig. [5l then after 
each of the 3 steps, we have 

\b)\a)\a)\a)^\b)\a(Bb)\a)\a)^\b)\b)\b)^\b)\b). 

The effect of this is to send |0) to |00) and |1) to 
with freshly post-selected qubits. 

In the Hadamard basis, the 3 qubit cat state is trans- 
formed to 

i(|000) + 1011) + 1101) + 1110)) = ^ E E l«) 1^) l« ® ^) ■ 

a=0 6=0 

In Fig. [6l a circuit is shown that uses 2 teleportations to 
bring in two additional states with the basis elements |c) 
and Id). After each step in this circuit, this results in 

|c) \d) \a) \b) |a ® 6) -> |c) \d) \a ® c) \b) \a © b) 
|c) \d) \a ®c)\b® d) \a®b) \c) \d) \b ®d)\c® b) 
|c) \d) \c®d) \d) \c®d)^\c®d). 



If these two previous ancillas are used together, they 
give 

\a)\b)^\a)\a)\b)~.\a)\a(Bb), 

which is CNDT. Alternately, we can combine the two an- 
cillas via teleportation to get the 4 qubit state 



\a)\b)\a)\a®b) = i(|0000) 



11011) + 10101) + 11110)), 



and teleport \a) and |6) into the first and second qubits 
respectively as illustrated in Fig. [T] 

For CSS codes, we can assume that noise from a log- 
ical CNOT acting on two logical data qubits determines 
the fault tolerant thresholds. See section |B] for a discus- 
sion of the other gates necessary for universal quantum 
computation. 



A. Repeated post-selection 

This section discusses the process of repeated post- 
selection. By performing a large number of post- 
selections, the resulting noise decreases in magnitude, 
converging to some value. 



The process which uses ancilla qubits can be broken 
down into two parts. First, we run post-selection re- 
peatedly to refine ancilla qubits. This process works by 
detecting bit flip errors as in Fig. [U then switching to the 
Hadamard basis and again applying the circuit of Fig. [T] 
to detect what were originally phase flip errors. As post- 
selection is repeatedly applied, the noise will converge 
to some fixed channel that can be estimated by iterative 
calculation. Second, we use teleportation to bring a logi- 
cal data qubit into the ancilla qubits; simultaneously this 
applies a logical CNOT. 

a. Noise under post-selection When we are run- 
ning error detection with bit flip errors, we apply CNOT 
transversally from the source qubits to the destination 
qubits. Bit flip errors will propagate from the first to 
the second qubit, just like we want them to. However, 
phase flip errors will propagate backwards. To deal with 
this, we minimize the phase flip errors in the destina- 
tion qubits. The source qubits will also have been post- 
selected for phase flip errors in the previous step. There- 
fore, before we apply the CNOT, we can assume that there 
is the same diagonal noise on the source and destina- 
tion qubits. We let a = Sa = D^. Then using Eq. [HI the 
noise after CNDT followed by a post-selected measurement 
(measurement with no error) becomes 



y^Qxz 



Vout = xy 



Qyi 



Q 



YZ 



1 + 
Qzi 



iz 
Qzz 



In the next step, we run error detection on the other 
type of noise (bit flip or phase flip). This can be repre- 
sented by applying a Hadamard gate to change bit flip er- 
rors into phase flip errors and vice versa, and then apply- 
ing the same error detection process. This gives Xout = z, 
Zout = X, yout = y, yielding the map 



[x,y,z\ 



[-2^0Ut; yout; ^OUtJ 



(9) 



Since we just post-selected for bit flip errors, after the 
Hadamard there is a low rate of phase flip errors Pf = 
Px + Py- From Eq. [3l we see that a; = 1 — 2p/ is close 
to 1. 



FIG. 5: Teleportation splitting circuit. 

^ 



co|0>+ci|l) 
^(|000) + |111)) 



X — z 



X 



CO |00) +C1 111) 



FIG. 6: Teleportation merging circuit. 
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Under repeated post-selection, the noise converges to 
a solution [1, x, y, z] of 

(1 + z^Qizf = z{Qyi + Qyz){Qzi + Qzz) 

1 + z^Qiz 

X = 

Qyi + Qyz 

y'^Qxz = z{l + z^Qiz) - x^Qxi- 

This is the effective noise after infinity iterations of post- 
selection. 

h. Teleportation Once there has been sufficient 
post-selection on the logical source qubit, we apply CNOT 
from it to a logical destination qubit, and then measure 
the physical destination qubits in the normal Z basis 
(which detects bit flip errors) and the source qubits in 
the X basis (which detects phase flip errors) as shown in 
Fig. m ft is important that the two logical qubits which 
will have a CNOT applied between them are post-selected 
last for opposite types of errors, one phase flip, one bit 
flip. This results in bit flip errors and phase flip errors 
being equally probable — useful for CSS codes which are 
very inefRcient at correcting noise of only one of those 
types at a time. 

Therefore, Sy = Dy, Sx ^ Dz ^ x {or z) Sz ^ Dx ^ 
z (or x) . Consider a pair of physical qubits that are about 
to undergo an X measurement and a Z measurement as 
part of the teleportation circuit shown in Fig. [D A bit 
flip error {X or Y errors) in the destination qubit results 
in an X error. A phase flip error {Z or Y errors) in the 
source qubit results in a Z error. If both occur, we have 
a Y error; and if neither occur, then no error. Thus, the 
probabilities of Eq. [7| are 



PI 




"f 


f 


f 


f " 




f 


Px 


1 


1 


f 


-f 


-f 




mxzQxi 


Py 


~ 4 


f 


-f 


f 


-f 




m^y'^Qxz 


Pz. 




f 


-f 


-f 


f 




mxzQiz 



(10) 



where m is defined in terms of the probability of a mea- 
surement error in Eq. [8l 

Since we assume independent noise on each qubit, we 
can judge how well the quantum code corrects this noise 
by using the optimal recovery adaptive concatenation 



TABLE IV: The probability of each of the 16 two qubit Pauli 
errors resulting from a CNOT gate for depolarizing noise and in- 
dependent noise. Note that the probability of a measurement 
error pm is a separate parameter. The depolarizing noise used 
in this paper has no measurement errors; Knill noise is depo- 
larizing noise with a probability pm ~ j^p of a measurement 
error. Forward type noise has no measurement errors. 



Prob. 


Depolarizing 


Forward 


Independent noise 


pi I 


1-p 


(1 




{l~psf(l^p,f 


Pix 


p 

15 


(1- 


-Pf)Pf 


{1-Pf)pf{l~ptf 


PlY 


P 
15 


(1- 


-Pf)pf 


{1-Pf)pf{l-Pb)pt 


Piz 


P 
15 


(1 


-Pff 


(1 -P/)^(l -P6)Pi> 


Pxi 


P 
15 


(1 


-PS? 


(1 -P/)^(l -P6)Pi> 


Pxx 


P 
15 


(1- 


-~Pf)Pf 


{1-Pf)pf{l-Pb)pb 


PXY 


P 
15 


(1- 


-Pf)Pf 


{l-Pf)Pfpl 


Pxz 


P 
15 


(1 


-Pff 


i^-Pffpl 


Pyi 


P 
15 


(1- 


-Pf)Pf 


{1-Pf)pf{l-Pb)pb 


Pyx 


P 
15 




P) 


P/(l -'Pb)Pb 


Pyy 


P 
15 




P) 


2 2 
PfPb 


Pyz 


P 
15 


(1- 


-~Pf)Pf 


{l-pj)pfpl 


Pzi 


P 
15 


(1- 


-'Pf)Pf 


{1 ~ Pf)pf{l ~ Pif 


pzx 


P 
15 




P) 


P'fil-Pbf 


PZY 


P 
15 




P) 


-Pi)Pi 


pzz 


P 
15 


(1- 


-ps)ps 


(1 -P/)P/(1 -Pb)Pb 



TABLE V: Noise that gives the hashing bound (Shannon en- 
tropy of 1) under repeated post-selection. The pin is the input 
rate of errors for the types of noise as defined in this paper. 
The Pa are the probabilities of a Pauli error a on each qubit 
as given in Eq. 1101 Refer to Tab. IIVI for descriptions of 
the types of noise. These values were determined by numerical 
calculations. 





Depolarizing 


KniU 


Forward 


Pin 


8.27515% 


6.90240% 


4.81816% 


Px = pz 


7.13361% 


7.52699% 


9.79217% 


PY 


4.78136% 


4.12990% 


1.21061% 



technique of |7|. These are calculated in the next sec- 
tion. 



FIG. 7: CNOT gate via Teleportation. liZ-a) is l^-i) CNOT \jp2)- 



l^i> 



|a) 

\b) 

\a) 

|ae6) 



X 



X 



X 



i^i) 

1^3) 



IV. THRESHOLD CALCULATIONS FOR 
UNCORRELATED NOISE ON EACH QUBIT 

The probabilities p^-a-' of a Pauli error aa' for the types 
of noise resulting from CNOT that are discussed in this 
paper are given in Tab. IfVI 



A. Depolarizing noise 

Suppose that the CNOT gate has probability ^ of each 
of the non-identity two qubit Pauli errors. The total 
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TABLE VI: Fault tolerant thresholds in terms of pin assum- 
ing independent noise on each qubit. Refer to Tab. IIVI for 
descriptions of the types of noise. 



Code Depolarizing 


KniU 


Forward 


Hashing 8.2751% 


6.9024% 


4.8182% 


[[7, 1, 3]] 8.229(7)% 


6.864(5)% 


4.8036% 


[[17,1,5]] 8.2(0)% 


6.8% 


4.790% 


[[23, 1, 7]] 8.25% 


6.88% 


4.805% 


TABLE VIL Capacity of codes for {px,Py,Pz) 




Code 


(0,0,p) 


(p,P,p) 


Hashing 


11.0028% 


6.3097% 


[[7,1,3]] 


10.963(2)% 


6.270(4)% 


[[17,1,5]] 


10.927(0)% 


6.251 % 


[[23, 1, 7]] 


10.968% 


6.29% 


[[4,2,2]] and [[6,2,2]] 


10.9466% 


6.271(9)% 



probability of no error is therefore 1 — p. This results in 
Qaa' = 1 - ifp, except for Qu = 1. 

We look at two possibilities for measurement errors. 
We refer to the case where there are no measurement 
errors as depolarizing noise. In this case, Pm = and 
m — 1. KniU noise 0, [lB| refers to the case where there 
is probability Pm = -^p of a measurement error, which 
results in m = 1 — ^p. 

We first assume the same independent noise on each 
qubit, and calculate the resulting such noise from Eq. [TOl 
We find the noise corresponding to the Hashing bound 
for these noises in Tab. |Vl that is, when the Shannon 
entropy of the noise is 1. We then determine whether a 
given code can correct this noise under repeated concate- 
nation with itself by using the optimal recovery opera- 
tor method of Q , which considers syndrome information 
from previous levels to determine the optimal recovery 
operator at each level, and uses Monte Carlo simulation 
to estimate the threshold. The stabilizers for the codes 
studied in Tab. |V]are given in [Tlj . 

In Fig. [H we plot the effect of the measurement errors 
Pm — (that is, the fraction r of the KniU measure- 
ment error) on the fault tolerant threshold (assuming the 
hashing bound). 



B. Independent bit flip and phase flip noise 

In this section, we consider noise from the CNDT gate, 
where bit fiips and phase fiips act independently of each 
other, and the resulting noise on the source and destina- 
tion qubits are also independent of each other. Note that 
since bit fiip and phase fiip errors are independent, their 
error detection and correction are done independently of 
each other. The calculations in this section can be per- 



formed directly from Eqs. [9] and [TOl however, because 
they are simpler in the case discussed here, we will go 
into more detail. 

The probability of a phase error on the source qubit 
and the probability of a bit flip error on the destination 
qubit are both designated p/ . The probability of a bit flip 
error on the source qubit and the probability of a phase 
flip error on the destination qubit are both designated 
Pb- The probability of a measurement error is Pm- The 
probabilities of each of the Pauli errors for this type of 
noise are calculated in Table ITVl For example, since there 
is probability 1—pf of no bit flip error on the destination 
qubit, probability 1 — pf of no phase flip error on the 
source qubit, probability 1 — pb of no phase error on the 
destination qubit, and probability ph of a bit flip error on 
the source qubit, it follows that pxi = {l—pf)^{l—pb)pb- 

Let pg be the probability of a bit flip or phase flip 
error immediately after post-selection is performed for 
that type of error. Let pb be the probability of an error 
before that post-selection is run, i.e. after the other type 
of error detection is run. 

We use the noise notation from Appendix[Cl Before we 
run post-selection, we have the noise Xb = l — 2pb on both 
the source and destination qubits. After we apply CNOT, 
the destination qubit gains the noise / = 1 — 2pf repre- 
senting the probability of forward type error from CNOT. 
Measuring it results in the noise m = 1 — 2pm , represent- 
ing the probability of a measurement error. Therefore 
the total destination qubit noise is Xgfm. After post- 
selection, we must add the backwards type noise from 
CNOT to the source qubit. From Sec. O we have 



Xg ~b post{xb, Xbfm) — b 



Xb + Xbfm 



1 



When we run post-selection on the other type of noise 
(bit flip or phase flip), the Xg noise of this type back- 
actions from the ancilla qubits. In addition, the forward 
type noise contributes / noise, and so the total noise after 
the other type of error detection becomes 

Xb = X f = b fXf,{ . 2f ) 

before we run error detection on this type of noise (either 
bit flip or phase flip) again. 

After repeated error detection and post-selection, the 
noise will tend towards an equilibrium value, which we 
calculate iteratively. 

After the logical qubits have been sufficiently refined, 
we start the teleportation process given in Fig. [71 Ap- 
plying CNOT results in a back-action. To improve the 
threshold, we have the source and data qubits optimized 
for opposite types of errors, so the back-action together 
with the regular noise results in XgXb noise. In addition, 
the CNOT gate gives / noise, and the actual measurement 
gives m noise, resulting in a total combined noise from 
teleportation of 



X = XbXgfm 



xlfm. 



(11) 
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FIG. 8: The hashing bound for fault tolerant depolarizing noise. The CNOT gate has probability jr of each non-identity two 
qubit Pauli error occurring. The probability of a measurement error is pm ~ -^rp, where the r = 1 gives the measurement 
error used by Knill [T^ . 



For CSS codes, the measurement process detects both 
X and Z errors, and is equally good at correcting both. 
If there are independent probabilities p of a bit flip and 
a phase flip error, then (jjxtPYtPz) = (p — ^ , P — P^) ■ 
We consider for which types of noise the code can correct, 
assuming no additional errors during error detection and 
correction, as in [7]. For CSS codes, this has the same 
threshold as noise in the form {px ,Py ,pz) = (0, 0,p). In 
Tab. IVIIl we calculate the thresholds for various CSS 
codes; these are found to be slightly below the hashing 
bound of p = 11.0028% found in 0. 

We define forward type noise to consist of probabil- 
ity pf oi forward noise, with back noise probability ph 
and measurement noise probability p„i both 0. There- 
fore, n = m = 1 and f = 1 — 2pf. At the hashing 
bound, the noises are / — 0.90363682 (probability p/ = 
4.818159% of a forward type noise), Xg = 0.98482389 and 
Xb = 0.87641757. The combined hashing bound noise is 
c = XgXbf = xlP = 0.77994427, that is Pc = ^ is the 
hashing bound. 



V. ENTROPY-BASED CORRELATED NOISE 
CALCULATIONS 

In the previous parts of this paper, we assumed that 
there is the same local uncorrelated noise on each qubit. 
However, degeneracies in the code caused by low weight 
stabilizer elements will introduce correlated errors that 
are noisier than the noise predicted by the uncorrelated 



noise model. The most significant degeneracies occur at 
the first level of the code. In this section, we calculate 
the actual noise after post-selection at the first level (and 
sometimes look at the weaker effects at the second level) 
of the code. We use the entropy of this resulting corre- 
lated noise to evaluate the fault tolerant threshold. We 
then compare these exact results to those of the uncor- 
related case. We see how the differences result from the 
degeneracies that arise from the low weight stabilizer el- 
ements. 



Entropy of the [[7, 1, 3]] code with forward type 
noise 



The [[7, 1, 3]] code is discussed in Appendix [P] 

Since the bit flip and phase flip errors are corrected 
independently of each other for this type of noise, we 
only have to consider correction of the one type of error. 
There can either be no error detected, in which case there 
was either a distance or a distance 3 error, or there are 
7 equivalent syndromes corresponding to the detection of 
an error, in which case a distance 1 or a distance 2 error 
occurred. If p„ are the probabilities of distance n errors. 
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then the Shannon entropy is 

b0+P3)(/(^^) + /(^^)) 
PO +P3 P0+ P3 

+ {Pl+P2){f{^^) + fi^^)) 
Pi +P2 Pl+ P2 

= ^fiPn) - fiPQ +P3) - .f{Pl +P2), 
n 

where 

f{x) = -a;log2X. 

We use the resuhs of Sec. [D] to perform calculations 
in terms of the syndromes of the [[7, 1, 3]] code that are 
similar to that of Sec. IIVBI 

Tab. I VIII shows that the [[7, 1, 3]] CSS code can correct 
(0,0, p) type noise for p < 10.963(2)%. This noise has 
an entropy of ei — ^ 1.0020(5) after the first level, and 
62 il.0011(8) after the 2nd level. 

We use the processes described in this section to cal- 
culate what happens to one level of the [[7, 1,3]] code in 
terms of the probabilities of a distance d error, instead 
of assuming the same noise on each qubit. We calculate 
for which value of p/ the entropy after the second level 
of the code is the same 62 as before, in the uncorrelated 
case. This gives a threshold for forward type noise of 
Pf — 4.6700%. As a check, we note that the entropy 
after the 1st level was ^1. 0020(8), almost the same as 
before. By repeatedly concatenating the resulting noise 
for several levels, calculations have confirmed that this is 
indeed the new threshold. 



B. Analysis of the difference 

In the previous section, we calculate that the actual 
threshold for forward type noise for the [[7, 1,3]] code is 
Pf = 4.6700%. This compares to the original approxima- 
tion oipf = 4.8036% found in Tab. Ell 

The difference occurs because the code is slightly de- 
generate; there exist cases where the error that would be 
measured in the destination ancilla qubit corresponds to 
a stabilizer element, and therefore is not detected as an 
error. From Tab. IXIIIl it can be seen that the [[7, 1,3]] 
code has 7 stabilizer elements of weight 4. Thus, if there 
are 2 errors in the source qubits and 2 errors in the desti- 
nation qubits, there can be an undetected weight 2 error 
in the source qubits. For each stabilizer element, this has 
an approximate probability (2)^3 of occurring, where Pg 
is the probability of an undetected error from Sec. IIVBI 
Together these give a factor of 42pg difference. 

In Eq. [TTl we see that errors left undetected (and 
uncorrected) after post-selection contribute 3 times to 
the combined rate of noise analyzed for correctability by 
the code. Therefore, the total estimated difference in 
the probability of an encoded error (crash probability) is 

Ce = 126p2. 



TABLE VIII: Analysis of difference for [[7,1,3]] code. Pa 
are the actual thresholds found from analyzing the correlated 
noise resulting from post-selection at the first level of the code. 
(However, later we find that the threshold for the forward 
noise is Pa = 4.6699%.) Refer to Tab. II VI for descriptions of 
the types of noise. 



Value 


Depolarizing 


KniU 


Forward 


Pe 


8.229(7)% 


6.864(5)% 


4.8036% 


Pa 


8.1096% 


6.7785% 


4.6700% 


62 


1.0012(1) 


1.0011(9) 


1.0011(7) 


Ps 


5.58% 


4.69% 


5.93% 


Pd 


9.42% 


9.48% 


10.04% 


Pa 


0.61% 


0.51% 


0.70% 


Ce 


0.47% 


0.33% 


0.62% 


Ca 


0.39% 


0.32% 


0.72% 



If p/ = 4.6700%, then pg = 0.70%, and so the es- 
timated difference is Ce = 0.62%. To calculate the ac- 
tual difference, we understand that if there is the same 
noise x on each qubit, the resulting noise map for an en- 
coded crash is f7{x) = \x^ - jx"^ [1]. For p/ = 4.6700%, 
X = 0.78794(5), and ^(x) = 0.7147. For pf = 4.8036%, 
X = 0.78073(6), and frix) = 0.7002. The difference in 
crash probabilities is half the difference of //(x), and is 
Ca = 0.72%. 



C. Further calculations 

In Tab. IVIIIi we perform similar calculations for depo- 
larizing and KniU noise. Pe is the threshold estimated by 
assuming the same noise on each qubit. 62 is the logical 
entropy at the 2nd level of the code using this noise. Pa is 
the actual threshold found by fully post-selecting at the 
first level of the [[7, 1, 3]] CSS code. Using the same noise 
on each qubit with the actual probability of an error pa'. 
Ps and Pd are the probabilities of source and destination 
qubit errors immediately before post-selection, pg is the 
probability of an undetected error after post-selection. 
Ce — 126pg is the estimated difference in crash probabil- 
ities for a bit flip error {X or Y). Phase flip errors {Y 
or Z) have the same probabilities. Ca is the actual (not 
the estimated) difference in the crash probabilities due 
to the degeneracies from the stabilizer elements. 

In each of the cases, the noise at threshold for the 
actual non-local noise Pa is found to have the same 62 
as the Pe noise, with the same noise on each qubit. It is 
interesting to note that for all of these cases, the entropy 
at the 2nd level of the code, 62, is the same; therefore 
to determine the threshold for a particular type of noise 
for the [[7,1,3]], one just has to find when the entropy 
of the logical noise at the second level of the code is 
62 ^ 1.0011(8). 

So far we have only considered the non-local effects 
of post-selection from the first level. What about the 
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TABLE IX: Values at the hashing bound. Refer to Tab. |IV] TABLE X; [[23, 1, 7]] thresholds. Refer to Tab. |IV] for de- 
for descriptions of the types of noise. scriptions of the types of noise. 



Value Depolarizing Knill Forward Value Depolarizing Knill Forward 

p 8.27515% 6.90240% 4.81816% ~e 8.25(3)% 6.88(2)% 4.805(4)% 

Ps 5.65% 4.80% 6.18% Ce 0.00017 0.00009 0.00035 

Pd 9.56% 9.67% 10.40% Ap 0.003% 0.002% 0.0040% 

Pg 0.63% 0.54% 0.76% - - 4.800(3)% 

Pr 8.25(0)% 6.88(0)% 4.801(4)% 



second level? In addition to the stabilizers in each of the 
sub-blocks from the previous level of the code, there are 
new stabilizers that consist of an encoded X in 4 of the 
7 sub-blocks. The lowest weight for these is 12, which 
consist of distance and weight 3 errors in each of 4 sub- 
blocks. There are 7 of these stabilizers at the top level of 
the code, so there are 7 ways to choose the sub-blocks. 
Each of the sub-blocks has 7 elements of distance and 
weight 3. Therefore there are 7^ stabilizers of distance 
12. The estimated difference in the crash probability is 
^3x7^ CM. 

For the forward type noise Cg — 6.5 x 10 ^, which 
results in a difference in pf of less than 10~^, so the 
final actual threshold for the forward type noise for the 
[[7, 1,3]] code is pf = 4.6699%. The effect on the other 
types of noise is insignificant. 



These calculations were also performed for Knill and 
depolarizing noise in Tab. |Xl For the Forward type 
noise, pe — 4.805(4)%, which results in a total noise 
oi p — 10.968%, the threshold for one type of noise for 
the [[23,1,7]] code. The logical entropy after the first 
level is ei = 1.0016%. For the other types of noise, the 
values of pe were calculated by determining which noise 
gave this same logical entropy of ei after the first level. 
Based upon calculations with change in entropy near the 
threshold under different levels of concatenation for the 
[[7, 1, 3]] and [[17, 1, 5]] CSS codes, this method should be 
accurate to at most a couple times 10~^ for the entropy 
and a small fraction of 10~^ for the thresholds. 



E. Encoding and post-selection process 



D. The [[23,1,7]] CSS code 

The [[23, 1, 7]] CSS code has 506 weight 8 elements cor- 
responding to no error i.e., 506 weight 8 stabilizer 
elements. Using the method of the previous section, the 
estimated difference in crash probability is 



3( I506p^. 



For the forward type noise near the hashing bound, this 
results in Ce = 0.00035. 

Again, using the noise notation of Appendix [Cl the 
undetected distance 7 elements have weights 7, 11, 15, 
and 23 [13] , so the noise map for a logical crash is of the 
form f23{x) = cyx^ + cux^^ + ci^x^^ + C23X^^ . Since no 
error input results in no error output, /(I) = 1. Since 
this code can correct any distance 1, 2, or 3 error, it 
follows that /'(I) = /"(I) /'"(I) = 0. Solving these 
gives the channel map of 



, , , 3795 7 805 11 1771 

f23(x) = x' X H 

■'^^ ' 512 64 256 



X 



,15 



385 23 
512 ■ 



The crash probability is 



l-f23{x) 



For the same noise on 



each qubit, we found the threshold to be Pe = 4.805(4)%, 
and X = 0.78064. Solving for when the difference in crash 
probability is A = 0.00035, we get pr = 4.801(4)%, only 
lowering the threshold by 4 x 10~^ in P/- The actual 
threshold was calculated to be Pa = 4.800(3)%. 



p The methods described earlier in this paper rely on 
the code being concatenated many times with itself. A 
possible problem occurs if we create encoded |0) for the 
[[7, 1, 3]] code like in Fig. [9l and post-select as described 
in this paper, then the Z noise will converge to the com- 
pletely depolarizing case, where each of the 8 error syn- 
dromes are equally likely. The reason for this problem is 
that Z is now a stabilizer, and there are now 7 stabilizer 
elements of weight 3. 

To correct this problem, instead of post-selecting just 
one logical qubit after the first level of the [[7, 1, 3]] code, 
we first create a logical |0) state for the [[7, 1, 3]] code as 
in Fig. [9] and again in the Ha dam ard basis, so that we 
have the logical states |0) and |-f). We apply a logical 



FIG. 9: Creating encoded |0y for the [[7, 1, 3]] CSS code. 
This resuhs in the state jO) = ^([0000000) + [0001111) 
[0110011) + [0111100) + [1010101) + |1011010> + [1100110} + 
jllOlOOl)). 
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CNOT transversally from each of the physical qubits of the 
1+) state to the |0) state to create an encoded Bell pair 
^|00) + 1 11) that uses 14 physical qubits. Now there 
are no longer any stabilizer elements of weight 3. The 
7 weight 4 ones were discussed in Sec. IV Bl The next 
lowest weight stabilizers are the 49 weight 6 stabilizers 
that correspond to Z Z. 

Post-selection can now be used to refine this state. To 
post-select, we start with two Bell pair states. We apply 
logical CNOT from the first logical qubit of the first Bell 
pair to the second logical qubit of the second Bell pair. 
Similarly, we apply logical CNOT from the second logical 
qubit of the first Bell pair to the first logical qubit of the 
second Bell pair. When applying the logical CNOT gates, 
we mix up the orders of the qubits, to even out the errors, 
which is better for post-selection. There are 168 ways 
to permute the 7 qubits of the [[7, 1, 3]] CSS code and 
preserve the code, let p{x) represent such a permutation. 
The distance 3 codewords are those qubits qi , q2 and 173 
such that their parity is even, that is, Qi © (72 ® 93 = 0. 
The parity of p{qi) , p{q2) andp^q^) is also even, andp(l), 
p(2) and p(4) determine the permutation. 

We then measure the second Bell pair in the standard 
basis, rejecting the first Bell pair if an error is detected. 
We apply the Hadamard gate transversally to the re- 
maining (first) Bell pair, to exchange bit flip errors and 
phase flip errors for the next round of post-selection. No- 
tationally, we consider the initial encoding of the Bell 
pair to be the times post-selected states. To create 
the n times post-selected state, we use an n — 1 times 
post-selected state as the source (1st) Bell pair, and a dn 
times post-selected state for the destination (2nd) Bell 
pair, dn = 0, 0, 1,2,3,... is a well-performing sequence 
for post-selection. 

After a sufficient amount of post-selection is per- 
formed, we take 7 Bell pairs, and measure one of the 
logical qubits in the Hadamard basis on the 2nd qubit, 
rejecting if there is an error. We now have 7 post-selected 
encoded qubits. We use these qubits to create the |0) 
state for the [[7, 1, 3]] code encoded with itself (using a 
total of 49 qubits), and use more Bell pairs to detect if 
there are any errors after each step, rejecting everything 
if an error is detected. We then construct an encoded Bell 
pair, using 98 physical qubits, and repeat the process de- 
scribed above for post-selection, rejecting everything if 
there is an error detected in a single qubit. 

Once we have done this for a number of levels and the 
rate of errors is sufficiently low, we have two options. 
One is that we continue to use the [[7, 1, 3]] code, and we 
construct the more complicated encoded ancilla shown 
in Fig. [71 The other possibility is that we prefer to use 
some other code, such as the [[23,1,7]] CSS code. In 
this case, we now encode into that code. However, we 
wish to remove the levels of the [[7, 1,3]] CSS code from 
the encoding, since the [[7, 1,3]] code might not have as 
good a threshold (when we actually use error correction, 
not this post-selection process). We can decode one level 
at a time, using the following process for decoding the 



[[7, 1,3]] CSS code. Suppose we have the encoded state 
Co JO) -I- ci]l). We can represent this as 

mi = xi (B e m2 = X2 ® e 

ms = a;2 © a;i ® e 7714 = 3:4 © e 

= X4 (B xi (B e rriQ = X4 (B X2 (B e 
mr = X4 © a;2 © a;i © e, 

where xi, 0:2, 2:4 are summed over and 1, and e rep- 
resents the encoded basis states. We post-select it for 
Z errors. Now we measure the last 4 qubits, reject- 
ing everything and starting over if the parity isn't even. 
TO4©m5 — x\ and 7714 ©mg — X2,^o by applying X gates 
to the first 3 qubits conditional on these measurements, 
we have mi = ?7i2 = 7713 = e, which is cq [000) -I- c\ Jill). 
We then measure two of the qubits in the Hadamard 
basis. To avoid introducing more error, we can reject 
if either one is in the ]— ) state. For the forward type 
noise near the hashing bound, the total bit flip error is 
about 3pg w 2%, and the total phase flip error is about 
Ph ~ 6%. Once we have removed the lowest level of the 
[[7, 1, 3]] code, we run post-selection repeatedly, and then 
remove another level of the [[7, 1,3]] CSS code. Eventu- 
ally, we remove all of the levels of the [[7, 1, 3]] code and 
are left with a logical ancilla state in our desired code, 
which may be the [[23, 1, 7]] code concatenated with itself 
repeatedly. 



VI. DISCUSSION OF OVERHEAD 

In Sec. lYl we found that the degeneracies of a code 
cause the errors on each qubit to cease being indepen- 
dent of each other, thereby lowering the threshold a bit. 
In this section, we will discuss how using finite resources 
affects the thresholds. We pick some noise rate below the 
threshold and a rate of error that we are willing to tol- 
erate per logical gate, which results in a finite amount of 
overhead involved. We examine how the noise converges 
below the threshold. 



A. Post-selection overhead 

Suppose that there are q logical qubits encoded in an n 
qubit code, giving a total of N = qn qubits. If the total 
bit fiip noise on the destination qubits before measure- 
ment is p, then the probability of not rejecting everything 
is roughly pk ~ (1 — p)^ , and so we have to run this an 
average of — times. The process described in Sec. IV El 

Pk 

results in an overhead of Os = j^{ot-i+ot-2 after s post- 
selections. Then, roughly Os is o{N{p'^^ + 1 — PkY))- 
After a while, the noise will tend toward an equilib- 
rium point, getting a factor r closer each time. There- 
fore, to get within e of the equilibrium point requires s 
to be of the order log^ e. If N is large, to get within 
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TABLE XI: Fidelity the same at 0th and 1st levels of code. 
Refer to Tab. llVl for descriptions of the types of noise. 



Code 


Noise 


P 


fidelity 


[[7,1,3]] 


KniU 


3.472% 


0.90602 


[[7,1,3]] 


Depolarizing 


4.039% 


0.91122 


[[7,1,3]] 


Forward 


2.9595% 


0.87703 


[[23, 1, 7]] 


Forward 


3.5471% 


0.85108 



e of the equilibrium point will require something like 
Ar(p-^)i°S'-^ = iVe-A''°9.(i-p) overhead. 

At the hashing bound for forward type noise, p « 15.3, 
so for Bell pairs for the [[7, 1, 3]] code, the predicted prob- 
abihty of not rejecting is pk ~ {1 — p)^** = 9.79%. The 
actual pk = 9.21%. The observed rate of convergence is 
r = 0.766. This results in o{Ne-°-^^^^) overhead. This is 
exponential overhead in the number of qubits, and dou- 
bly exponential in the number of levels of concatenation 
of the code. 



B. Convergence by level 

Suppose we have a distance d code concatenated with 
itself / times, the initial entropy was t, and the threshold 
entropy was t^. Near the threshold, the entropy behaves 
roughly as t — (tc — t){d'')°', where a = logj logj e « 
0-53. 

Away from the threshold, if A = i't<:^i)<i — ^ ^j-^g ^.g^^g 

logical error has a magnitude of around 10"'^ or 10"** for 
A = 2, and goes to very rapidly after this. 

For example, suppose we have the [[7, 1,3]] CSS code 
concatenated with itself I = 4 times and our noise is of 
the form (0,0,p), and p is | of the threshold value of 
10.963(2)%. Then d = 3, = 0.49880, and t = 0.3492. 
Therefore, A « 2.5. The probability of an encoded error 
is 2 X 10"^. 

If the noise is some e below the threshold, then to get 
the noise below a given value requires that be o(e-" ). 
If the code encodes 1 qubit into n qubits, then the number 

log^ Tl 

of qubits n' is o(e 5 ). For the [[7. 1,3]] code, this is 

C. Fixed fidelity points 

Note that just below the threshold, the rate of an error 
increases for the first few levels of concatenation. Sup- 
pose we require that the rate of error be the same after 
the first level of the code as it was from the noise in 
Eq. H For the KniU type noise with the [[7,1,3]] CSS 
code, p = 3.472% has a fidelity of pi = 0.90602 in both 
cases. This can be thought of as a very crude proof that 
p = 3.472% is not above the threshold. Other values are 
calculated in Tab. IXII 



VII. CONCLUSION 

We find that for various CSS codes, particularly the 
[[23,1,7]] CSS code, when extremely large amounts of 
resources (i.e. remotely interacting ancilla qubits) are 
available, the threshold is close to that of the hashing 
bound (when the Shannon entropy of the noise is 1). 
We conjecture that the hashing bounds given in Table IVl 
represent an upper bound for fault tolerant thresholds. 
This bound is much lower than the best previously known 
upper bounds of 45% [3] and 29.3% [13]. It is higher than 
a fault tolerant threshold for the adversarial noise model 

The CSS codes are formed from classical codes and 
therefore do not exceed the Shannon hashing rating 
bound. The hashing bound had been conjectured to be 
the upper bound for non fault tolerant error correction, 
but it was found that codes can exceed this 0, 0, [H, 111] . 
However, the codes that exceed the hashing bound seem 
to do so because of their degeneracies, and these degen- 
eracies substantially hurt the threshold, as is seen in Sec. 
IVl The bit flip codes, which seem to yield the high- 
est capacities Q, have (2) ways of having Z errors on 
two qubits, which results in no error. This effect on the 
threshold is large enough that the threshold will be below 
the hashing bound. 

We conjecture that if CNOT is the only multiple qubit 
gate implemented directly, the fault tolerant hashing 
bound described in this paper serves as an upper bound 
on the fault tolerant threshold. As can be seen from 
Tab. Kl the [[23, 1,7]] CSS code provides high fault tol- 
erant thresholds, which are at least 99.6% of the hashing 
bound thresholds given in Tab. |Vl leaving little room for 
improvement. 

In the future, we hope to apply these ideas to ana- 
lyze the amount of overhead, and develop new realistic 
fault tolerant schemes. Unfortunately, it is unrealistic 
to get close to the fault tolerant thresholds given here. 
The resources needed to run quantum computing very 
close to these thresholds will be exceedingly high; to do 
so would require googols of universes with remotely in- 
teracting qubits. However, it would be useful to compare 
schemes with restricted amounts of overhead to the truly 
ultimate thresholds found in this paper. Since the thresh- 
olds found here are so high, achieving even some fraction 
of them with a reasonable amount of overhead may offer 
an improvement over current schemes. For example, we 
find a threshold of 6.9% for KniU noise, which is much 
larger than the 1% threshold with a reasonable amount 
of overhead found by KniU [161] . 
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APPENDIX A: CODE METHOD 

In this section, we use the quantum error correction 
channel formalism of Q to find the results of most of 
Sec. HI] by an alternate means. 

Suppose we have a code that encodes k qubits into 
n qubits. We have an operator £ which encodes 4*^ di- 
mensional density vectors representing density states on 
k qubits into 4" dimensional density vectors (as shown in 
Eq. [T]) representing density states on n qubits. The noise 
that occurs on these n qubits is represented by a 4" by 
4" dimensional operator Af. This is followed by measur- 
ing the stabilizer operators to determine the syndrome 
i. We next apply a recovery operator Ri and decode to 
evaluate the resulting logical errors. This is the channel 
map technique described in 0, H, . 

The resulting noise map for each recovery operator Ri 



1 



:E' oO{R^)oAf oE. 



The total noise map \s Q — Q^^ . 

Now we use these channel maps for a different way 
of looking at Sec. HT] If the physical data qubit was 
a |0) -I-/3 1 1) , we model this by having the 2 physical qubits 
in the state a |00) |11), apply CNDT, and then measure. 
represents no error and 1 represents an error, just like 
in Fig. [3l If this process is noise free, the data qubit is 
now a|0) -|-/3|1). 

In Fig. [3l we assumed that the source qubit has a 
pre-existing noise S, and the destination qubit has a pre- 
existing noise V. We then apply CNDT, and the noise Q 
is created. We follow this by measuring the destination 
qubit. 

We look at the two codes in Tab. IXIII CI is the two 
qubit bit flip code. By applying CNDT from its first qubit 
to its second qubit, we change it into the C2 code, and 
vice versa. 

These can be written out in matrix form. For example 
for C2, 



1001000000000000 
0000100100000000 
0000000010010000 
0000000000001001 



Let O(CNOT) be the superoperator for CNDT. Together 
with its noise operator, the faulty CNDT gate is Q o 
O(CNOT). Together with the pre-existing noise, we have 



Q o ©(CNDT) o 5 (g) P. 



(Al) 
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TABLE XII: Description of two codes 





CI 


C2 




|00) 


|00> 


ll> 


111) 


|io> 


s 


{77, ZZ] 


UI IZ\ 


Si 


II + ZZ 


77 + 7Z 


Ex 


XX + YY 


XI + XZ 


Sy 


XY + YX 


YI + YZ 


£z 


ZI + IZ 


ZI + ZZ 


R 


{II, IX} 


{II, IX} 



Note that this changes from code CI to code C2. If we 
wish to do the calculations in terms of code CI, we apply 
CNOT at the end. If we wish to do the calculations in 
terms of code C2, we apply CNOT at the beginning. The 
logical noise from these codes are 

TVci = ©(CNOT) o Q o ©(CNOT) oS(g>V 
Mc2 = Qo ©(CNOT) oS^Vo ©(CNOT). 

By conjugating, CNOT produces the following map c which 
acts on two qubit Pauli matrices as c(cr) as 

(//), {IX),IY ^ ZY, IZ ^ ZZ, XI ^ XX 
XY ^ YZ, XZ ^ YY, YI ^ YX, (ZI), (ZX). 

This map has order 2, that is, c{c{a)) = cr. Note that bit 
flips {X or y ) propagate forward as X, and phase flips 
{Y or Z) propagate backwards as Z. 



are encoded as themselves (or their adjoint) transversally 
[l0| . Measurement of a Pauli operator and CNOT are also 
encoded as themselves transversally. Together with any 
1 qubit pure non Pauli eigenstate, these give universal 
quantum computation [20,] . 

Transversely encoded gates are very useful to prevent 
propagation of errors. For a more in depth discussion of 
how these are useful for fault tolerance, see p7j . 

Typically, one prepares a "magic" state, :^(|0) + 

Vi\l)) or cos(|) |0) +sin(|) |1) that can be used to cre- 
ate the I gate. We discuss 3 approaches to get a "magic 
state" : 

• The method described by Nielsen and Chuang to 
measure the state using a cat state [J] . 

• There is a 15 qubit Hamming CSS code, where the 
^ gate can essentially be encoded as itself If 
there is a Bell pair consisting of the 15 qubit CSS 
code and some other code, this can be used to tele- 
port the logical qubit into another code. A similar 
method would be noting that the 15 qubit code is 
similar to the [[7, 1, 3]] CSS code, and converting it 
directly into that code. 

• We have a high distance code encoded in a Bell 
pair. One of the logical qubits is decoded into a 
physical qubit and then the ^ gate is applied 
to that qubit, followed by a measurement in the X 
eigenbasis. Then, the code is encoded into the 15 
qubit code, and post-selection is run on that as in 



1. Diagonal noise 



APPENDIX C: NOISE 



We now use the shorthand Afa = Ma. a- The initial 
source noise has diagonal noise 5 = [1, Sx, Sy, Sz], and 
similarly the destination noise has the diagonal noise 
V = [1, Dx, Dy, Dz]- Control can be used to make the 
noise diagonal, as was done with the Singular Value De- 
composition Theorem for channels [8,]. This turns one 
qubit unital noise into diagonal noise, and any non-unital 
part has little effect on the threshold. 

If there is no error detected, and a and a' are one qubit 
Pauli matrices, then 



N, 



aZ.a' Z ) 



G<7,a' — ^{XalM'I + Nal^a'Z — NaZ,a'I — NaZ,cj'z)- 

In the case of diagonal noise, we have Eq. [H 



In this section, we discuss our representation of noise, 
and derive a formula for the noise after post-selection. 

If there is probability Px of an error, we write the noise 
as X = 1 — 2px. 

Since x = (1 - 2py){l - 2p^) = 1 - 2{py+p^ - 2pypz), 
two noises y and z are combined as x = yz. 

Suppose a qubit with noise xi has CNOT applied to a 
qubit with noise X2, and post-selection is performed by 
measuring the 2nd qubit in the standard basis. We either 
have no error with probability pg = (1 — pi)(l — P2), or 
an undetected error with probability pb — P\Pi- The new 



probability of an error is then p 

post (xi, X2) = 1 - 
2 - 2pi - 2p2 



Vb 



Pg+Pb ' 



and so 



2p 



^ Pg ~Pb 
Pg +Pb 
Xi + X2 

2 - 2pi - 2p2 + 4pip2 1 + X1X2 ' 



APPENDIX B: OTHER GATES AND FAULT 
TOLERANT TRANSVERSAL GATES 



APPENDIX D: [[7,1,3]] CSS CODE 



For the doubly even (stabilizers have weights divisible 
by 4) CSS codes discussed in this paper, the Clifford gates 



In this section, we find non fault tolerant probabilities 
of errors for the [[7,1,3]] CSS code, which are applied 
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TABLE XIIL Distances d of errors of weight w (errors on 
w qubits) for the [7, 4, 3] classical code that generates the 
[[7,1,3]] CSS code. 





d = Q 


d=l 


d — 2 


d — 3 







1 











1 


1 





7 








7 


2 








21 





21 


3 





28 





7 


35 


4 


7 





28 





35 


5 





21 








21 


6 








7 





7 


7 











1 


1 



earlier for fault tolerant calculations. 

The [[7, 1,3]] CSS code is generated from the classical 
[7, 4, 3] Hamming code given by the parity check matrix 



1 1 1 1 
110 11 
10 10 10 1 



(Dl) 



The stabilizer generators for the [[7, 1, 3]] CSS code are 



gi = III XX XX 
g2 = IXXIIXX 
gs = XIXIXIX 



g4, = IIIZZZZ 
35 = IZZIIZZ 
ge = ZIZIZIZ 



The encoded Pauli operators are 

X = X®^ Z = Z^\Y = iXZ 



A circuit for generating the encoded |0) is given in Fig. 

m 



1. Probabilities of errors of a given distance 

Suppose we have the [[7,1,3]] CSS code, with uncor- 
related bit flip and phase flip noise. Then the two types 
of noise can be corrected independently of each other. If 
we wish to find the probability rrn of a distance i phase 
flip or bit flip error, we need just consider the [7, 4, 3] 
classical code that generates the CSS code. 

Looking at the parity check in Eq. IDll we see that 
there are 8 different possible error syndromes. The 
syndrome corresponds a distance error (no error) or 
distance 3 error (undetected error) . The other syndromes 
correspond to either a distance 1 or distance 2 error. The 
code corrects any one error. Counting the 2^ different 
possible errors, and their distances, we get Tab. IXIIII 

In terms of the quantum stabilizer code, distance er- 
rors (no error) correspond to members of the stabilizer 
group S, which preserve the codespace. Distance 3 er- 
rors are encoded errors that commute with the stabilizer 
elements, and so are in C{S) \ S, where the C represents 



centralizer. Distance 1 and 2 errors are errors outside of 
the codespace. 

If we have independent probability p of an error on 
each qubit, then the probability of a distance d error is 
md, where 

mo = {l-py + 7p\l-pf 
mi = 7p{l - pf + 28/(1 - p)4 + 21p5(l - pf 
m-2. = 21p2(l _ p)5 + 28/(1 - pf + 7/(1 - / 
m3 = 7/(l-/^+/. 
If we use X = 1 ~ 2p instead, we get 



(D2) 



Alternately, we see from the centralizer that mg = 

^'^'^^ le^ "'"'^ • This represents a distance error (no er- 
ror). What happens to each of the terms as we add an 
error of a flxed d? We flnd all the ways to add an error, 
and replace x with —x on that particular qubit, and then 
average the result. We get 



mo 




1 


7 


7 


1 




1 


mi 


1 


7 


7 


-7 


-7 




x^ 


1712 


~ 16 


7 


-7 


-7 


7 




x^ 


ms 




1 


-7 


7 


-1 




x' 



d = 


d = 1 


d = 2 


d = 3 


1 


1 


1 


1 




1^3 






x^ 
















x^ 






x^ 










x^ 


-x^ 


x'' 


-x' 
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For this code, an encoded error is a distance 2 or 3 
error. Since the syndrome is either distance or 3, and 
the other 7 error syndromes are each either distance 1 or 
distance 2, and the syndrome calculation is linear over 
FJ, we get 16 equivalence classes of 8 errors each. We 
can choose a representative element of each, for example 
all of the 0, 1, 6, and 7 qubit Z errors. We then calculate 
what happens when errors are combined by flnding the 
distance of the combined error. For example, if we pick 
2 random distance 1 errors, we have a j probability of a 
distance error, and a i probability of a distance 2 error. 
Then, if we combine a probability ad of having distance 
d errors with a probability bd of having distance d errors, 
we get a combined probability a'^ of having distance d 
errors 







bo 




¥^ 




¥^ 


b3 




ao 


a[ 




hi 


bo 


+ 1^2 


&3 




&2 




ai 


a'2 




b2 


63 


+ f6i 




+ 1^2 






a2 


."3. 




bs 




¥^ 






bo 




«3. 
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If we post-select on a and b, then we keep the result 
with probability 



1 



1 



Pk = aobo + -aibi + -0262 + 03^3, 



7 



7 



and the result is 



16 



1 

Pk 



aoba 
\a2b2 



(D5) 



